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Abstract.  We  formulate  and  analyze  parallel  preconditioners  for  systems  of  equations  arising  from 
the  p-version  finite  element  method.  Using  new  theoretical  results  fot  polynomial  spaces,  we  prove  that 
condition  number  grows  as  log'  p,  where  p  is  the  degree  of  the  polynomial  space.  Numerical  results  are 
presented  showing  that  the  condition  number  indeed  grows  very  slowly  with  p. 

Key  words.  p-Version  Finite  Element  Method,  Preconditioning,  Domain  Decomposition,  Parallel 
Computation,  Polynomial  Sobolev  Inequality,  Polynomial  Extension  Theorems  __ 

1.  Introduction.  In  this  paper,  we  study  fast  parallel  preconditioners  for  systems  of 
equations  arising  from  the  p-version  finite  element  method.  The  p-version  finite  element 
method  (4,  5]  achieves  increase  of  precision  by  increasing  the  degree  of  elements  rather  than 
decreasing  their  size  as  the  h-version. 

The  finite  element  method  is  based  on  a  variational  formulation  of  the  original  problem: 

Find  u  such  that 


(1.1)  u  €  H  :  a(u,v)  =  f(v)  Vv  €  H 

where  H  is  a  Hilbert  space,  a(u,  v)  =  a(v,u)  is  a  bilinear  form  defined  on  H  x  H  and  /  is  a 
bounded  linear  functional  defined  over  H .  We  assume  that  a(u,v)  satisfies 

(1.2)  C"1||u||2w<a(u,u)<C'||u||27,  C>Q 

The  finite  element  method  consists  of  choosing  a  finite  dimensional  subspace  5  C  H  and 
posing  problem  (1.1)  on  S  x  S.  In  what  follows  the  bilinear  form  a(u ,  v)  is  understood  to  be 
on  5x5.  Selecting  basis  (shape)  functions  for  S  transforms  problem  (1.1)  into  the  problem 
of  finding  the  solution  of  a  system  of  linear  equations 

(1.3)  Ax  =  y 


where  A  is  a  positive  definite  symmetric  matrix. 

Our  basic  approach  to  solving  (1.3)  is  the  preconditioned  conjugate  gradient  method. 
We  construct  a  preconditioning  form  c(u,  v)  such  that 


(1.4) 


to |  c(i  .  a )  <  a(u,  u )  <  TO2c(u,  u ),  0  <*  mi  <  m2, 


holds  for  any  u  £  S.  The  form  c(u,v)  is  also  chosen  so  that  the  problem  c(u,v)  =  g{v)  (for 
an  appropriate  linear  form  g(v)  and  a  chosen  set  of  basis  functions)  is  easier  to  solve  than 
the  original  problem.  We  show  that  the  relative  condition  number  m2/m,  grows  at  most  as 
fast  as  log2(p)  for  one  type  of  finite  element  space,  and  as  plog2p  for  another. 

The  solution  of  the  problem  with  the  bilinear  form  c(u,v)  decomposes  into  local  highly 
parallelizable  computations  and  the  solution  of  a  relatively  small  global  auxiliary  problem. 

We  study  two  different  methods.  In  the  first  (Section  3),  the  global  problem  is  identical 
to  the  system  for  p  —  1,  which  presents  a  very  small  part  of  the  computational  cost  for  high 
p.  This  method  is  related  to  the  domain  decomposition  method  by  Bramble,  Pasciak,  and 
Schatz  [7]  for  the  h-version.  For  other  pertinent  considerations,  sec  Babuska,  Griebel,  and 
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Pitkaranta  [3],  Babuska  and  Elman  [1].  Babuska,  Elman,  and  Markley  [2],  and  Williams  [23]; 
for  numerically  computed  condition  numbers  and  relation  to  other  methods,  see  Mandel  [16]. 
For  another  related  method  using  preconditioning  by  elements  of  order  higher  than  one  for 
three-dimensional  elasticity,  see  Mandel  [15]. 

In  the  second  method  (Section  5),  the  global  system  has  one  variable  per  element, 
which  correponds  to  an  avarage  or  the  solution  on  every  element.  This  method  was  inspired 
by  the  methods  of  Dryja  [11]  and  Bramble,  Pasciak,  and  Schatz  [7]  for  the  h- version  in 
three  dimensions,  and  an  analogous  method  was  developed  by  Mandel  [14,  16,  17]  for  the 
p-version  in  three  dimensional  elasticity.  Note  that  the  first  method  leads  to  a  fast  growth 
of  the  condition  number  in  the  three-dimensional  case  [16] .  The  analysis  in  this  paper  could 
be  used  also  for  three-dimensional  problems  subject  to  availability  of  appropriate  extension 
theorems. 

The  paper  is  organized  as  follows:  For  the  reasons  of  clarity  of  presentation,  the  methods, 
their  analysis,  and  practical  results  are  presented  in  the  first  five  sections  of  the  paper,  while 
technical  auxiliary  results  are  given  only  at  the  end.  In  Section  2,  we  introduce  some 
notation  and  conventions.  In  Section  3,  we  formulate  and  analyze  the  first  preconditioner. 
Numerical  results  for  a  parallel  implementation  of  this  preconditioner  are  presented  in 
Section  4.  Section  5  contains  analysis  of  the  second  preconditioner.  Sections  6  and  7 
contain  auxiliary  results  about  Sobolev  norm  estimates  for  polynomial  spaces,  which  are  of 
separate  interest.  In  Section  6,  we  prove  a  discrete  Sobolev  inequality  for  polynomials  on  a 
segment,  and  bound  the  H^2  norm  of  a  polynomial  with  zero  boundary  values  in  terms  of 
its  H l//2  norm.  Section  7  contains  various  results  about  H 1  bounded  polynomial  extensions 
of  functions  defined  by  polynomials  on  the  the  boundary  of  a  triangle  or  a  square.  The 
theoretical  results  of  the  last  two  sections  are  related  to  the  results  of  Bramble,  Pasciak  and 
Schatz  [8,  10,  9,  7]  and  Widlund  [21,  22]  for  the  ^-version. 

We  would  like  to  express  our  appieciation  of  the  interest  and  comments  of  Professor 
Olof  Widlund  relating  to  the  results  of  this  paper. 

2.  Notation,  Conventions,  and  Preliminaries.  Let  5R2  be  two  dimensional 
Euclidean  space  and 

<3  =  {(«,'»)6»2llfl<i,W<i}, 

f  ={((,i|)€  |  0  <  1J  <  Vz(i  +  1),  -1  <  f  <  0,  or0<7<v1(l-{),0<£<l} 


be  the  reference  square  and  triangle  as  shown  in  figures  2.1  and  2.2,  respectively.  We  shall 
use  the  generic  notation  K  for  both  Q  and  T  when  the  distinction  is  unimportant. 

The  image  of  Q  (resp.  T )  under  the  mapping  Fq  (resp.  Fj),  Fq  :  Q  — >  Q  =  Fq(Q) 
(resp.  Fj  :  T  —*  T  —  F/(T ))  is  denoted  by  Q  (resp.  T ).  Similarly  to  before  we  use  the 
notation  K  for  both  Q  and  T  when  the  distinction  is  unimportant. 

We  shall  assume  that  the  mapping  Fq  is  a  bijection  and  that 


l^bll.oo.Q  <  Cj/lQ, 
\Jfq  lo.oo.y  —  CnJiyt 


! Fq]  li.oo.g  -  ^2 

\J f. lo.oo.y  ^ 
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(2.1) 


Fig.  2.1.  Reference  square 


where  J[Q  is  the  Jacobian  of  Fq,  J  f-i  is  the  Jacobian  of  Fq1,  and 

I^U.q  =  sup  Wd'FqU, 

(,n£Q 

Hl=* 

Similar  assumptions  are  made  about  F/ .  We  do  not  need  to  assume  anything  about  h.y  or 
hi ,  they  are  simply  numbers  proportional  to  the  diameter  of  K. 

Let  0  C  S2  be  a  cuivilinear  polygon,  that  is,  a  domain  which  is  bounded  by  a  simple 
curve  consisting  of  a  finite  number  of  smooth  arcs  with  the  end  points  at  the  vertices  of  fi. 
Further,  let  JC  be  a  decomposition  of  Q  into  a  finite  number  of  curvilinear  quadrilaterals  or 
triangles  such  that 

a)  fi  =  (J  and  f°r  all  decompositions  under  consideration  the  constants  C,  in  (2.1) 

are  the  same,  that  is  the  mappings  are  uniformly  bounded. 

b)  The  intersection  *  7^  j  is  either  a  common  vertex  or  common  side,  or  the 

intersection  is  empty. 

c)  If  K,f\Kj  =  7 ,j,  7,j  being  the  common  side,  then  the  mappings  Fj^'  and  Ff' 
coincide  on  7 in  the  usual  sense  of  the  finite  element  method. 

d)  The  vertices  of  coincide  with  the  vertices  of  some  K . 

In  this  paper  C  denotes  a  generic  constant  which  does  not  depend  on  p  or  any  of  the 
functions  involved,  but  which  may  take  different  values  in  different  places,  even  in  the  same 
formula. 

Let  us  now  define  the  finite  element  spaces  on  5  and  T.  By  p  >  1,  we  denote 

the  set  of  all  polynomials  on  Q  which  are  of  degree  at  most  p  separately  in  the  variables  £ 


Fig.  2.2.  Reference  triangle 


and  T}.  This  space  is  usually  called  the  space  of  tensor  product  polynomials.  By  Vp(Q),  we 
denote  the  span  of  the  union  of  set  of  all  polynomials  of  (total)  degree  at  most  <  p  with  the 
polynomials  which  are  of  degree  at  most  p  in  £,  degree  1  in  p  and  of  degree  1  in  £,  degree 
at  most  p  in  tj.  This  space  coincides  with  the  space  of  the  serendipity  element  [24].  The 
space  Vp(Q)  is  the  minimal  space  which  includes  all  polynomials  of  (total)  degree  p  and  has 
!••««)<!  fine*  in- •  vhirh  c?  n  he  fnrfn-ir*  '  i-'r.  new's* ,  cW*.-  r.n^  ir,*err»!  shape  function?, 
guarantees  good  approximation  properties.  In  [3j,  the  set  Vp(Q)  is  denoted  by  Qp  and  the 
set  Vp{Q)  by  Q'p.  By  Vp(T)  we  denote  the  set  of  all  polynomials  of  (total)  degree  at  most 
p,  and,  for  an  interval  /,  VP(I)  is  the  set  of  all  polynomials  of  degree  at  most  p  on  /.  The 
generic  notation  VP{K )  will  be  used  for  Vp(K). 

We  shall  assume  that  the  basis  (shape)  functions  of  Vp{K)  can  be  divided  into  the 
following  three  sets: 

a)  The  set  I  of  internal  shape  functions.  These  shape  functions  are  zero  on  dK. 

b)  The  sets  T,,  i  =  1, . . .  ,n  (n  =  3  for  K  —  T  and  n  =  4  for  K  —  Q)  of  side  shape 
functions.  If  7,  is  a  side  of  K  then  a  side  shape  function  associated  with  7,  is  zero 
on  dK  \  7,.  T,  is  the  set  of  all  side  shape  functions  associated  with  7,. 

c)  The  sets  A/),  i  =  l,...,n  of  the  nodal  shape  functions.  For  a  scalar  problem,  the 
set  Af,  contains  one  shape  function  which  has  value  one  at  v,  and  is  zertf  on  the 
opposite  side(s). 

The  spans  of  T ,  T,  ,  A ft  will  be  denoted  by  T,  T,,  //..  The  above  definition  does  not  imply 
a  unique  set  of  shape  functions.  There  are  many  different  ways  to  create  shape  functions 
satisfying  the  above  conditions  and  spanning  the  same  space.  We  refer  to  [3,  17]  for  details. 

Let  M  C  S2  be  a  domain.  Then  by  Hk(M ),  k  >  0  an  integer,  we  denote  the  standard 
Sobolev  space  on  M,  and  by  ||  •  \\k,M  (resp.  |  •  U,Af)  the  norm  (resp.  seminorm).  On 
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I  =  ( a,b )  we  also  introduce  the  Sobolev  space  with  fractional  index  by  interpolation  by 
the  K -method  [6],  H'/2{I )  =  H'  (/)) ,/2  with  the  norm  ||  -  ||i/2,/  and  the  space 

H'J,2  =  Ht\(I))  1/2  with  the  norm  „||  •  |jl/2>/.  Further,  by  H'/2(3K)  we  denote  the 

space  of  all  traces  of  u  £  H'(K )  and  ||uj|  {/2,ok  =  «*»f  1MI i ,k  where  the  infimum  is  taken  over 
all  v  £  H'{K)  for  which  v  —  u  on.  dK.  We  have 


\u\ 


1/2,/ 


u{x)  -  u(y) 
X  -  y 


1/2 


5 


and 


1 1/2,1 


Hl?/2f 


u2(x) 


(x  -  a) 


( b-x ) 


dx 


u 


2  r  u2(x)dx 

l/2J  +  J  (x  -a)(x  -  by 


where  ~  denotes  the  usual  equivalence  of  norms.  Let  7,  be  the  sides  of  K,  i  =  1,2,3  (resp. 
i  =  1,2, 3, 4)  and  v,  be  the  vertices  of  K  such  that  v,  is  common  for  7^,)  and  7/(.) -  Then  it 
is  well  known  that 


IM1 


2 

1/2  .dK 


n 


Y  IMIl/2/w  + 


^  }  (uj(,)(0  -«ho(0)  ^ 

h'  i«i 

1=1  U  1 


where  the  u3(i)  denote  the  restriction  of  u  to  7 j(i)  and  t  is  the  distance  to  v,.  It  is  also  well 
known  that  if  tr  £  H'l2(dK)  then  there  exists  U  £  H'(K )  such  that 


i*  L  ■>  •'  *  ,  ‘ 

In  this  paper  we  shall  be  interested  in  the  following  model  problem:  Find  u  such  that 

dxi 

(2.2)  —  An  =  /,  in  SI,  u  =  0  on  d'fl,  —  =  /2  on  d2fl 

on 

with  We  shall  assume  that  d'fl  ^  0  and  d'fl  and  d2fl  consist  of  entire 

sides  of  dfl.  Note  that  the  assumption  that  d'fl  ^  Q  is  only  for  the  sake  of  simplicity  and  is 
not  essential. 

We  shall  understand  problem  (2.2)  in  the  usual  weak  form.  To  this  end  let 

H  =  {u  £  H'(fl)  :  u  =  0  on  d'ft} 


and 

r 

a{u,v)  —  J  Vu  ■  Vvdxdy, 

n 


be  the  bilinear  form  defined  on  H  x  H.  Further  let 


g{v)  =  J  f\vdxdy  4-  J  finds 


d’n 
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be  a  bounded  linear  functional  on  H .  Then  by  the  weak  solution  of  (2.2)  we  mean  u{!  such 
that 

(2.3)  u, i  E  H  :  a(u„,v)  =  g(v )  Vv  E  H. 

The  solution  u()  exists  and  is  unique. 

The  finite  element  solution  is  defined  in  the  usual  way.  Let  1C  be  the  partition  of  Cl  and 
Vk={u(=H:  u\k  oFkE  VP(K),  K  E  K}. 

Then  the  finite  element  solution  is  a  function  uFF  such  that 

(2.4)  uFF  E  VK  :  a(uFF,v)  =  g(v),  Vv  E  VK. 

The  solution  of  (2.4)  exists  and  is  unique.  By  integration  over  the  elements  we  have 

(2.5)  a(u,v)  =  y~)  a k(ilk ,vK),  aF(u,v)  =  J  VuKXfvH  dx  dy. 

where  =  u\F  and  vh  =  v\F  are  the  restrictions  of  u  and  v  to  K .  Note  that  the  bilinear 
forms  aF{u,v)  are  positive  semidefinite. 

For  any  K,  E  1C  we  also  have 

aK(u,f)  =  aK(  uK,vK) 

where  u'-  --  u  .  Ff  E  l,, (A).  v,  -  vr  o  *'■  \'Vxh  j.  and  aK  is  defined  on  the  master 

element  K. 

The  form  aF  is  different  for  every  K  E  K.  but 

aF(uK  xuK)  ~  a(uK,uK), 

where 

(2.6)  a{uK,uK)  =  J \VuK\2dj]di 

K 

and  because  of  (2.1)  the  equivalence  constants  are  independent  of  Kx.  Hence  we  do  not  need, 
at  least  for  our  purposes,  to  distinguish  between  the  bilinear  forms  dF  and  a.  Further,  we 
do  not  need  to  make  any  distinction  between  the  basis  (shape)  functions  on  K  and  K ,  or  in 
general  between  K  and  K. 

3.  Preconditioning  by  Linear  Elements.  For  any  u  €  V*_  and  K  E  fC  we  have  the 
decomposition 

(3.i)  *K  =  fx.+ix.  +  «r 

i=i  i=i 
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where  uK  —  u\K ,  uR  t  £  Af,,  u£t  £  f,,  uR  £  I,  and  n  =  4  for  the  rectangle,  n  =  3  for 
the  triangle.  (As  mentioned  in  section  2  we  do  not  distinguish  between  K  and  K).  The 
partition  (3.1)  is  unique.  Define 


(3.2) 


where 


c{u,v)  =  J2  ck(uH  ,vk  ), 

X'gA. 


(3.3) 


(u\t,K)  =  a* 


+  1C  +  «K  («l*  -  *>*)■ 


The  main  goal  of  this  section  is  to  analyze  the  spectral  equivalence  of  the  forms  c(u,u)  and 
a(u,v). 

Lemma  3.1.  If 


m\af<;{uK  ,uh  )  <  ck{ur  ,uK)  <  m2af((u,<  ,vR  )  VK  £  AT  V«K  £  Vp(K), 


where  m.)  and  m2  are  independent  of  K  £  K  and  u,  then 


m,a(u,u)  <  c(u,u)  <  m->a(u,u),  Vu  £  V. 


Proof.  Sum  over  all  elements  K  £  K.  D 

The  following  lemma  shows  that  to  bound  the  condition  number,  it  is  enough  to  bound 
tbe  energy  of  the  terms  of  the  decomposition  (3.1). 

Lemma  3.2.  Lr  any  u  '  *_  ;  u. l 


(3.4) 

(3.5) 

(3^) 


uv.]\ ].K  -  &1  !uA  l?,AO 

J=< 

ksJi.K  <  3  =  !,•••, n, 

l*f  l?.K  —  ll.K»  .7  =  L  •  •  •  >  n- 


Then  it  holds  that 


(3.7) 


m, a^(nK,vK}  <  ck(uk,uk)  <  m2af<(uK  ,uh  ) 


with  m2jm\  <  (n  +  2)(fq  -f  nb2  4-  63). 

Proof.  We  have  from  the  Cauchy-Schwarz  inequality 


*k(uk,uk)  =  «(£<,  +  +  £<.  +»..) 

1=1  1=1  1=1  1=1 

<  (n  +  2)  (a*  (£«£„£«*,•)  +  £ «*(«£> “£,)  +  uf.) 


i=i  i=i 

K  ..K 1 


i=l 


=  (n  +  2)cK(u*,u«), 
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so  m |  >  1  / ( n  f  2 ) .  Further,  using  (3.4)-  (3.6),  we  obtain  c^-(uh  ,uh  )  <  (6,  +  nb2  +  6:i)|'uh  |(  K . 

□ 

Let  us  now  b^"nii  the  asymptotic  behavior  as  p  -*  oo  of  the  constants  b,  for  some 
particular  cas^';  ~-i  the  decomposition  (3.1)  genera*ed  by  specific  spaces  of  shape  functions. 
L-EMM  •  j.3.  Let  \P{k)  -  'Pp(jT)  or  P2(Q),  Af,  be  the  set  of  linear  functions  when 

K  ~  T  and  bilinear  when  K  =  Q,  and  for  all  i  =  1, . . .  ,n, 

(3.8)  a(u,u)  =  0  Vu  G  f  ,*■  G  T, 

where  a  is  defined  by  (2.6).  Then  (3-4)~(3.6)  hold  with 

b i  <  C(1  +  log2  p), 


where  C  is  independent  of  p. 

Proof.  Let  u  G  VP{K)  and  u  —  u  +  A,  A  G  3?.  Then  uVj  =  (Yff- 1  Kv(J)  +  A 

and  u9.;  =  u9j,  uj  =  uj.  Because  the  seminorm  |  •  |i.k  and  the  norm  in  the  factorspace 
H}  (K)  fPp(K)  are  equivalent,  we  may  assume  that 

IMIi.*  ^  C\u\\.fr 

Because  |M|I/2.,,  <  C||u||,  K,  applying  Theorem  6.2  we  obtain 
(3.9)  IluvJJ  K  <  ^(1  +  logp)|u|2  K. 

Thus  we  obtain  b,  <  C{  1  lo %p)  and  we  have  for  u ,  =  u  -  i  u''o  that 

)|U|||;_K  <  C(1  +  logp)„u,|-  K, 
and  using  Theorem  6.2  once  more  we  obtain 

NII?.»(;>K)  -  C(l  +  IogP)!lUlH|.K- 

Hence  by  Theorem  6.6, 

»ii«iii:,«.<c(i  +  i»8'p)Ni:.K 

for  all  sides  7,.  By  Theorem  7.4  if  K  -  T  and  by  Theorem  7.5  if  K  =  Q  there  exists 
u9  j  G  T,  +  I  such  that  u„ j  =  u{  on  7^  and 

Kj;.*  <  Colluillf/a.^  <  C{  1  +  log2  p)|u|2  K. 

From  condition  (3.8)  we  conclude  that  there  also  exists  uSiJ  G  Tj  such  that 

NjIIi.k  <  C(l  +  log2p)|u,  *| 

and  u9 ,,  =  u,  on  7,.  Therefore  we  obtain  b2  <  C(1  +  log2  p)  .  Obviously  uj  =  u,  -  £"=1  u9-J 
and  hence  63  <  C(1  +  log2  p)  as  well.  □ 
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LEMMA  3.4.  Let  Vp(K)  =  Pp(Q)  and  let  all  other  assumptions  of  Lemma  3.3  hold. 
Then  (3-4)-(3.6)  hold  with  b,  <  Cp2{  1  +  log2p). 

Proof.  The  proof  ic  analogous  to  that  above  using  Theorem  7.6  instead  of  Theorem  7.5. 

□ 

We  are  now  ready  for  the  main  result  of  this  section. 

THEOREM  3.5.  Let  the  assumptions  of  Lemmas  3.3  and  3-4  hold.  Then 

m\a(u,u)  <  c(u,u)  <  m2a(u,u) 

holds  for  any  u  £  14  with 

(3.10)  m2/mi  <  C(1  +  log2  p) 
if  Vp{K)  =  Vf(T)  or  VP(K)  =  Vp{Q),  K  e  K  and 

(3.11)  m2/m\  <  Cp2(l  +  log2  p) 

if  Vp(K)  £  Vp(Q)  f0T  some  K  £  K.  The  constants  C  in  (3.10)  and  (3.11)  are  independent 
of  p  and  K,  and  the  bound  (3.10)  cannot  be  asymptotically  improved. 

Proof.  Combining  Lemmas  3.1  -  3.4,  we  obtain  (3.10)  and  (3.11).  To  show  that  the 
bound  (3.10)  cannot  be  asymptotically  improved,  we  take  the  function  function  v  from  the 
second  part  of  the  proof  of  Theorem  6.2,  map  it  on  each  side  of  K  and  extend  to  a  function 
u  with  minimal  energy  in  VP(K).  Then 

W\\,K  ~  l°S  P, 

bu*  it  holds  for  the  side  components  that 

it  ~  o||v||?/2.((,.1r)  ~  logV 

J=l 

Consequently,  m2  >  (l/C)log2p;  but  it  is  easy  to  see  that  m i  <  1  -  □ 

Realizing  that  the  energy  of  the  nodal  components  was  bounded  by  a  Clogp,  the  above 
proof  also  shows  that  the  growth  of  the  condition  number  as  p  — »  oo  is  in  this  case  primarily 
due  to  the  coupling  between  adjacent  sides  (and  not,  for  example,  between  the  nodal  and 
side  shape  functions). 

Let  us  note  that  we  conjecture  that  (3.11)  is  too  pessimistic,  and  that  in  fact  the  result 
should  be  the  same  as  (3.10).  This  conjecture  is  supported  by  numerical  experiments. 

The  solution  operator  for  the  problem 

(3.12)  u  £  VK  :  c(u,v)  =  g(v),  Vn  £  14 

serves  now  as  the  preconditioner.  It  is  easy  to  see  that  problem  (3.12)  has  a  unique  solution, 
since  c(u,u)  is  symmetric  and  positive  definite. 

Using  this  preconditioner  we  can  apply  the  conjugate  gradient  method.  One  such 
modified  preconditioner,  which  is  very  natural  for  existing  p-version  finite  element  programs 
such  as  PROBE  [20),  will  be  studied  in  the  next  section.  Problem  (3.12)  is  obviously  much 
more  easily  solved  than  the  original  problem  o.nd  the  procedure  is  highly  parallelizable. 
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4.  Implementational  Aspect  and  Numerical  Experiments.  Theorem  3.5  is  the 
basis  for  various  versions  of  the  preconditioned  conjugate  gradient  method  which  can  be 
asymptotically  equivalent  yet  different  in  practical  performance.  For  various  aspects  we 
refer  to  [2,  1,  3,  14,  17,  16,  23].  The  method  has  the  following  essential  steps: 

a)  Construction  of  the  standard  set  of  shape  functions,  i.e.,  the  sets  A/’,,  I\,  X.  Here 
various  practical  considerations  play  an  important  role,  for  example  the  hierarchical 
character  of  the  functions.  For  the  discussion  of  the  design  of  Aft,  T,,  and  2’,  we 
refer  to  [3]. 

b)  Transformation  of  the  sets  T,  of  the  shape  lunctions  to  a  new  set  T*  which  satisfies 
(3.8),  while  preserving  the  span  of  all  basis  functions.  This  transformation  can 
be  based  on  the  standard  form  a(un,vK)  (and  hence  made  only  once  on  the 
reference  element)  or  on  the  actual  form  a^(uK )vK)  made  separately  (in  parallel) 
on  every  element.  The  transformation  can  be  made  by  elimination  (condensation) 
of  the  internal  shape  functions  and  in  the  latter  case  decreases  the  size  of  the 
global  stiffness  matrix  on  which  we  iterate.  The  transformed  shape  functions  are 
scaled  (normalized)  and  also  orthonormalized  in  T’.  Then  the  stiffness  matrix 
corresponding  to  the  preconditioning  form  c(tt,v)  is  diagonal  except  for  a  diagonal 
block  corresponding  to  p  =  1,  and  the  iterations  are  very  simple.  It  is  also  possible 
to  choose  basis  functions  on  the  reference  element  so  that  this  transformation  can 
be  avoided,  see  [17].  For  high  p  the  transformation  is  relatively  expensive  when  the 
actual  form  a(u,v)  is  used  but  is  fully  parallel  on  the  element  level.  Furthermore, 
the  transformation  approach  is  natural  for  an  existing  p-version  finite  element  code 
such  as  PROBE. 

c)  1  or  p  —  1  preconditioning  and  conjugate  gradient  iteration  either  ttie  global  stiffness 

matrix  can  be  assembled  or  the  iterations  can  be  made  using  local  stiffness  matrices. 
The  p  —  1  preconditioning  is  relatively  inexpensive  for  higher  p. 

Various  other  aspects  play  an  important  role  in  the*  practical  performance  of  the 
algorithm.  We  shall  not  enter  into  details  here  but  shall  display  a  numerical  example  based 
on  one  of  the  versions  of  the  method  and  implemented  on  Alii?  it  FX/85.  Let  us  consider 
problem  (2.2)  on  fl  =  (  —  1,1)  x  (  —  1,1)  with  the  partition  K  into  n  X  n  identical  squares. 
We  shall  further  assume  that  d2Q  —  dfl.  The  global  stiffness  matrix  is  then  singular  with  a 
simple  zero  eigenvalue  and  a  constant  eigenfunction. 

Let  us  first  consider  the  case  where  the  set  Vp{Q)  is  used  on  every  element  of  the 
partition.  This  set  is  used  in  the  program  PROBE  [5,  20 j . 

1)  The  one  element  sets  Af,  of  nodal  functions,  each  consisting  of  the  the  usual  bilinear 
function  N,,  defined  by 

*.({,<»)  =  -l). 

AW.l)  =  j(! +{)(!->)), 

5  Computational  support  was  provided  by  the  Advanced  Computing  Research  Facility  at  the  Argonne 
National  Laboratory. 
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W>«.7)=  ^(1  +«)(!  +1). 

«.«,>»)  =  j(l- 00+ ■>)• 

2)  The  sets  T,  of  the  side  shape  functions.  There  are  p  —  1  shape  functions  associated 
with  every  side  7 ,,t  =  1,2, 3, 4.  These  are  defined  as 

^(f.O'jfi  +  O^W.  *  =  1,2 . p  - 1, 

jyp'u.v)  =  ~(‘  +  *)*.({).  •  =  1.2 . .  -  1. 

j'f'K,-?)  =  hll(i  -{>$,(,), ;  =  1,2 . p-i, 

where 

*.«)  =  /  P,(t)dt 

-1 

and  Pj(t)  is  the  Legendre  polynomial  of  degree  j.  The  term  (  —  1)'  is  needed  in  jVjJ 
and  JV,^  to  obtain  invariance  with  respect  to  rotation  of  coordinates. 

3)  The  set  I  of  the  internal  shape  functions.  For  p  >  4  there  are  (p  —  2)(p  —  3)/2 
internal  shape  functions  defined  as 

(4.1)  KM,V)  =  (1  -  nu  -  n2)P>(i)PM,  0  <  (i  +  j)  <  p  -  4. 


For  example,  if  p  —  8  there  are  47  shape  functions,  consisting  of  4  nodal,  28  side  and  15 
internal  shape  functions. 

This  set  of  shape  functions  is  hierarchical,  which  is  important  in  practical  considerations. 
In  the  case  of  the  set  Vp(Q)  the  set  T  is  expanded  so  that  it  contains  the  functions  from 
(4.1)  for  all  i.j,  0  <  i,j  <  p  —  2.  See  [2,  3)  for  details. 

In  our  numerical  experiment,  we  use  the  following  approach: 

1)  Using  the  above  shape  functions  we  create  the  local  stiffness  matrix.  We  shall 
simulate  the  general  case  where  the  local  stiffness  mai/ices  are  different  and  compute 
them  separately  in  parallel.  We  further  consider  two  variants: 
a)  Preconditioning  by  elimination  of  internal  shape  functions  and  diagonal  scaling 
of  the  resulting  reduced  matrix.  Because  this  reduced  matrix  is  in  fact  the 
stiffness  matrix  with  original  nodal  functions  and  new  side  shape  functions 
satisfying  (3.8),  this  corresponds  to  the  preconditioner  (3.3)  with  form  a  in  the 
second  term  replaced  by  the  form  corresponding  to  the  diagonal  of  the  reduced 
matrix.  To  implement  (3.3)  completely,  one  can  orthogonalize  the  new  side 
shape  functions  in  the  energy  inner  product,  but  the  condition  number  would 
be  further  reduced  only  slightly  [3,  16j. 
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Table  4.1 

Number  of  iterations  to  reduce  the  error  in  the  energy  norm  by  factor  of  10“  '  for  n  =  2  (4  elements). 


p 

Set  PSiQ) 

Set V%Q) 

Preconditioning 
by  elimination 
of  internal 
shape  functions 

No  elimination 

Preconditioning 
by  elimination 
of  internal 
shape  functions 

No  elimination 

2 

12 

16 

8 

8 

4 

17 

31 

13 

19 

6 

20 

41 

.6 

26 

8 

22 

51 

18 

32 

10 

24 

59 

19 

39 

12 

26 

66 

20 

45 

14 

27 

74 

21 

50 

16 

29 

79 

22 

54 

b)  No  elimination  is  made,  the  interior  shape  functions  are  scaled  to  obtain  ones 
on  the  diagonal.  This  corresponds  to  the  definition  of  the  decomposition  from 
original  shape  functions,  and  using  the  diagonal  forms  obtained  from  these 
shape  functions  to  replace  a  in  (3.3)  for  sides  and  interiors. 

2)  The  conjugate  gradient  method  is  used  with  preconditioning  (3.3)  without 
assembling  the  global  stiffness  matrix.  Except  for  the  solution  of  the  problem  with 
p  =  1  in  every  iteration,  all  computations  are  performed  element  by  element.  To 
measure  the  convergence,  we  shall  only  consider  the  case  with  zero  exact  solution 
and  random  Martins  vector.  We  have  run  our  tests  for  the  case  d]Q  =  0, 

which  leads  to  singular  stiffness  matrix  with  constant  eigenfuction.  The  process 
was  thus  adapted  by  including  orthogonal  projections  onto  the  complement  of 
the  nullspace.  Direct  solution  of  the  problem  for  p  =  1  was  done  by  band  LU 
decomposition  and  elimination  of  internal  basis  functions  by  full  matrix  Choleski 
decomposition.  The  modified  side  shape  functions  have  not  been  orthogonalized. 

Table  4.1  shows  the  number  of  iterations  required  to  reduce  the  original  error  (measured 
in  the  energy  norm)  by  a  factor  of  10-4.  We  consider  the  case  n  =  2  (i.e.  4  elements).  The 
results  of  the  previous  section  indicate  that  the  number  of  iterations  should  grow  at  most 
as  logp  in  the  case  of  the  set  V^(Q)  and  at  most  as  plogp  for  the  case  V*(Q),  with  the 
conjecture  that  the  growth  is  only  logp.  In  Fig.  4.1,  we  show  the  relation  between  p  and  the 
number  of  iterations  in  semilog  scale.  We  see  that  the  case  'P  '(Q)  needs  fewer  iterations  than 
Vp(Q),  although  the  proof  is  still  open.  In  both  cases  the  growth  is  logp  for  p  in  practical 
ranges.  (The  growth  logp  would  lead  to  a  straight  line  in  Figure  4.1).  Fig.  4.2  shows,  in 
loglog  scale,  the  growth  of  the  number  of  iterations  in  the  case  when  no  preconditioning 
by  elimination  is  made.  We  see  that  the  number  of  iterations  grows  about  as  p ^2.  This  is 
related  to  the  growth  of  the  condition  number  of  the  local  stiffness  matrix  as  O(p’). 

To  compare  the  practical  potential  of  both  varians,  we  have  to  realize  that  the 
number  of  iterations  is  not  solely  essential  for  the  effectiveness  of  the  method,  because 
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Table  4.2 

Timing  in  seconds  on  Alliant  FX/8  using  4  processors  for  n  =  2  (4  elements) 


1 

Set  Vl{Q) 

Set  V:\Q) 

Preconditioning 
by  elimination 
of  internal 
shape  functions 

No  elimination 

Preconditioning 
by  elimination 
of  internal 
shape  functions 

No  elimination 

2 

0.465 

0.018 

0.433 

0.576 

0.011 

0.548 

0.294 

0.010 

0.270 

0.293 

0.010 

0.267 

B 

0.673 

0.051 

0.610 

1.150 

0.040 

1.098 

0.463 

0.020 

0.436 

0.671 

0.019 

0.639 

6 

0.969 

0.248 

0.781 

1.791 

0.127 

1.651 

0.743 

0.063 

0.633 

1.131 

0.052 

1.062 

8 

1.762 

0.946 

0.805 

3.186 

0.327 

2.345 

0.852 

0.206 

0.663 

1.403 

0.116 

1.274 

10 

3.976 

2.857 

1.092 

6.237 

0.729 

5.483 

1.280 

0.556 

0.709 

2.142 

0.249 

1.879 

12 

9.263 

7.353 

1.864 

12.498 

1.420 

11.032 

2.173 

1.328 

0.829 

3.344 

0.468 

2.856 

14 

20.368 

17.088 

3.200 

24.498 

2.535 

21.875 

3.971 

2.866 

1.078 

5.500 

0.820 

4.655 

16 

41.082 

35.516 

5.414 

42.245 

4.201 

37.893 

7.250 

5.679 

1.534 

9.062 

1.346 

7.676 

Total  time 

Legend:  Local  stiffness  time 

Conjugate  gradients  time 
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Tabi.e  4.3 

Number  of  iterations  and  timing  in  seconds  on  Alliant  with  8  processors  for  the  set  V 


p 

n  —  4 

n  =  5 

n  —  6 

n~  7 

n  =  8 

3 

II 

o 

10 

11 

10 

10 

10 

0.313 

0.473 

0.455 

0.659 

0.754 

l 

0.032 

0.035 

0.125 

0.252 

0.146 

0.263 

0.401 

0.298 

0.368 

0.471 

0.531 

14 

14 

14 

14 

14 

14 

0.458 

0.634 

0.726 

0.972 

0.990 

1.289 

4 

0.042 

0.071 

0.198 

0.322 

0.158 

0.307 

0.399 

0.537 

0.498 

0.613 

0.753 

0.896 

16 

16 

16 

16 

16 

16 

6 

0.643 

0.953 

1.112 

1.532 

1.614 

2.625 

0.133 

0.255 

0.334 

0.504 

0.506 

0.844 

0.491 

0.666 

0.698 

0.924 

1.049 

1.662 

17 

17 

17 

17 

17 

17 

8 

1.224 

1.740 

2.192 

2.844 

3.211 

5.110 

0.429 

0.824 

1.106 

1.538 

1.650 

2.732 

0.769 

0.877 

1.044 

1.260 

1.486 

2.265 

19 

19 

19 

19 

19 

19 

2.191 

3.740 

4.781 

6.592 

7.589 

11.664 

1.138 

2.268 

2.913 

4.064 

4.573 

7.412 

1.027 

1.431 

1.826 

2.468 

2.933 

4.140 

21 

21 

21 

21 

20 

20 

12 

4.536 

8.406 

10.788 

15.431 

17.701 

28.187 

2.747 

5.433 

6.876 

9.653 

10.930 

17.742 

1.746 

2.895 

3.832 

5.693 

6.632 

10.286 

22 

22 

22 

22 

22 

22 

14 

10.036 

17.871 

22.824 

31.812 

39.008 

61.259 

5.998 

11.854 

14.872 

20.779 

23.669 

38.391 

3.913 

5.845 

7.702 

10.710 

14.857 

22.123 

23 

23 

23 

24 

23 

23 

16 

18.902 

34.623 

44.622 

61.470 

74.892 

120.481 

11.931 

23.746 

29.920 

41.750 

47.458 

77.899 

6.761 

10.545 

14.232 

19.135 

26.547 

41.193 

Number  of  iterations 
Total  time 
Local  stiffness  time 
Conjugate  gradient  time 
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Full  line  is  the  number  of  iterations  for  the  space  V*,  dashed  line  for  the  space  Vjj. 

the  preconditioning  by  elimination  of  interior  is  expensive.  In  the  table  4.2  we  show  the 
timing  on  AUiant  FX/8  with  4  processors  (i.e.,  one  per  element).  We  report 

a)  the  total  time, 

b)  the  time  for  the  local  stiffness  matrix  computations,  elimination  and  scaling, 

c)  the  time  for  the  conjugate  gradient  method. 

From  table  4.2,  we  clearly  see  the  timing  of  the  main  parts  of  the  computation.  In 
the  case  where  no  elimination  is  made,  the  local  stiffness  matrix  time  consists  only  of  the 
matrix  construction  and  scaling,  while  in  the  case  of  elimination  it  also  includes  the  time 
for  elimination  which,  for  high  p,  is  the  main  part  of  the  total  time.  Comparing  these  times 
we  see  that  the  construction  of  the  local  stiffness  matrix  is  not  overly  expensive.  Further 
we  see  that  the  use  of  the  set  Vp(Q)  is  clearly  superior  to  that  of  Vp(Q).  The  set  Vp{Q) 
will  be  more  accurate  than  'Pp(Q)  for  the  same  p,  but  "Pp(Q)  has  more  basis  functions  and  a 
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Iterations 


Fig.  4.2.  Number  of  iterations  without  elimination  of  interior 


Degree  of  Elements 


Full  line  is  the  number  of  iterations  for  the  space  V*,  dashed  line  for  the  space  V*. 

greater  increase  in  accuracy  will  be  obtained  by  increasing  p  in  'Pp(Q)-  (For  example  p  =  11 
for  Vp(Q)  is  comparable  with  p  =  16  for  Vp{Q).) 

So  far  we  have  presented  the  data  for  the  case  n  =  2  (i.e.  4  elements).  Table  4.3  reports 
the  number  of  iterations  as  functions  of  p  and  n  for  the  set  'Pp(Q)  and  the  timing  as  in  table 
4.2.  We  see  that  the  number  of  iterations  is  independent  of  n  and  the  growth  with  p  is  the 
same  as  in  table  4.1.  We  report  here  condensed  data  only;  for  a  detailed  breakdown  of  the 
timing  and  an  exact  description  of  the  tests,  we  refer  to  [2,  23).  Here  we  mention  only  that 
the  total  time  does  not  equal  to  the  sum  of  local  stiffness  time  and  CG  time.  This  difference 
includes  the  time  for  the  LU  decomposition  for  p  =  1  as  well  as  various  communications  and 
bookkeeping  operations.  Note  in  Table  4.3  that  the  local  stiffness  time  for  large  p  is  almost 
proportional  to  the  number  of  elements  (see,  e.g.  p  =  16,  n  =  4,8),  while  for  low  p  other 
factors  prevail.  Because  in  our  model  problem  the  local  stiffness  matrices  are  identical,  we 
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could  compute  the  local  stiffness  matrices  only  once  by  each  processor.  The  local  stiffness 
time  would  then  decrease  by  the  factor  n2/ 8.  The  LU  decomposition  for  p  =  1  is  of  order 
3%  of  the  total  time  for  high  p  and  10%  of  total  time  for  low  p.  Finally,  we  mention  that 
for  p  =  16  and  n  =  10,  the  size  of  the  global  stiffness  matrix  (number  of  degrees  of  freedom) 
was  12521. 

We  conclude  this  section  with  several  remarks  based  on  detailed  computational  results 
as  reported  in  [23] .  The  method  is  well  parallelizable  and  the  observed  speedup  is  very 
high.  However,  the  speedup  is  different  for  various  parts  of  the  procedure.  The  local 
stiffness  matrix  computation  is  completely  independent  and  thus  the  speedup  is  more  than 
05%.  The  conjugate  gradient  iterations,  although  parallelizable  element  by  element,  require 
considerable  communication  between  elements  and  thus  the  speedup  is  smaller.  The  entire 
computation  has  speedup  of  order  at  least  85%,  depending  on  the  number  of  elements  and 
the  degree  p. 

We  expect  that  that  the  speed  up  will  be  essentially  the  same  for  parallel  computers 
with  distributed  memory  because  of  good  load  balancing.  The  stated  timings  also  allow 
us  to  roughly  estimate  the  times  for  other  variations  of  the  approach,  assuming  that  the 
same  number  of  iterations  are  required.  The  stopping  criterion  10-J  is  realistic  because 
the  discretization  error,  measured  in  the  energy  norm,  is,  in  practice,  larger  than  1%  and 
hence  an  additional  error  of  0.01%  is  fully  acceptable.  The  reported  limes  are  for  elements 
which  are  not  distorted.  We  can  expect  that  strong  distortion  of  the  elements  will  strongly 
influence  the  number  of  iterations  required;  see  Mandel  [17,  16)  for  related  condition  numbers 
in  three  dimensions  and  curved  elements. 

There  are  many  other  obvious  versions  and  variations  of  the  implementation.  We  have 
shown  one  possibility,  which  can  be  easily  implemented  as  a  part  of  an  existing  p-version 
code. 


5.  Preconditioning  by  an  Auxiliary  System.  The  method  of  Section  3  cannot  be 
applied  successfully  to  three-dimensional  problems  because  the  values  of  ||«v  jlli.K"  cannot  be 
bounded  by  a  power  of  logp  times  ||u||i,k-  independently  of  u  as  p  — *  oo.  We  shall  introduce 
another  preconditioning  system  which  has  been  successfully  applied  in  three  dimensions,  see 
Mandel  [17].  We  shall  analyze  the  analogous  two  dimensional  procedure.  In  order  to  analyze 
the  three-dimensional  procedure,  we  would  need  the  appropriate  extension  theorems  in  three 
dimensions;  the  proof  is  then  completely  analogous.  The  method  we  obtain  is  related  to 
that  of  Bramble,  Pasciak,  and  Schatz  [7]  for  the  h-version. 

For  u  G  V/t  and  if  €  1C  we  write  as  before  in  (3.1), 


(5.1) 


i=i 


i=i 


where  Uy  ,  G  A,,  us,,  ^  r>,  and  G  /•  The  definitions  of  the  spaces  Af,,  t,,  and  T  will  be 
different  here.  In  Section  3,  it  was  essential  that  the  span  of  A/”,,  i  =  1, . . .  ,n,  contains  the 
constant  functions.  This  is  no  longer  required  here;  instead,  we  tak®  <'»«•«*  of  th®  constant 
component  separately.  To  this  end,  let  A  be  the  space  of  functions  which  are  constant  on 
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each  K  G  AC.  Then  on  ( IT  x  A)  x  (V'v  x  A),  we  define  the  form 

d(u,\;v,p )  -  Y  dh  [uK  ,  AK  ;  vK,pH  ), 


dK(uK,\K-,vK^K)  =  d'K(uK  -\\v 


K 


K 


,  K  ,.K 


*), 


and 

d't<(u ,v)  =  Y  af<(uv,j’vv,j)  +  H  aK(u»,. j’v*j)  +  a><(ur  ). 

j  =  l  J=1 

The  solution  operator  for  the  problem 

(5.2)  u  G  V  :  d(u,A;u,/z)  =  g(v),  Vu  G  V^,  /i  €  A 

will  now,  analogously  to  (3.8),  define  the  preconditioner  (using  u  only).  Existence  of  the 
solution  operator  follows  easily  from  the  observation  that  d(u,  A;u,  A)  >  0  with  equality  only 
if  u  =  0  and  A  =  0.  The  system  (5.2)  obviously  splits  into 

(5.3)  <£*(«*  — A\1)  =  0 
and 


(5.4)  Y  dK(uK  ~  iv><)  =  9(v),  Vu  €  VK 

K€K 

See  [17]  for  more  details  about  the  solution  of  (5.3)  and  (5.4).  Now  we  have,  analogously  to 
Lemma  3.1, 

Lemma  5.1.  If 

7 nxaK{uK ,uK)  <  dK{uK ,\K{uKy,uK ,\K(uK))  <  m2aK(uK  ,uK) 
where  A K(uK)  is  defined  by  (5.3)  and  my  and  m2  are  independent  of  K  €  AC  and  u ,  then 

m,a(u,  u)  <  <f(u,  A(u);u,  A(u))  <  m2a(u,u). 


Further  we  have 

LEMMA  5.2.  Assume  that  for  any  uK  G  Vk  -such  that  f  uK dxdy  =  0,  tt  holds  that 


K 


I«?Xk  <  M«KIL.  \<Mk  < 


myaK(uK ,uK)  <  dK(uK,  A(uK);  uK ,  A(uK))  <  m2an{uK ,uK) 

IS 


(5.5) 
Then 

(5.6) 


holds  for  all  E  Vk  with  <  (2n  +  l)(nb i  -f  n.62  +  6;j). 

Proof.  '"he  proof  is  the  same  as  for  Lemma  3.2  observing  that  for  any  A, 

aK(uh  -  A ,uK  ~  A)  —  a/<(uh  ,  uh  ) 

and  using  the  fact  that  all  the  terms  in  (5.6)  are  invariant  with  respect  to  the  addition  of  a 
constant  to  u.  □ 

Let  us  now  determine  the  constants  6,  for  some  particular  choices  of  the  shape  functions. 
Lemma  5.3.  Let 

a)  V„(K)  =  Vi(T)  orV‘(Q) 

b)  For  any  u  E  N,,  i  =  1, .  .  .  ,  n,  and  all  sides  adjacent  to  the  vertex  v}, 

(5.7)  a(u,v)  =  0  ,Vt/E  lufy, 

c)  For  1  —  1, . . . ,  n, 

(5.8)  a(u,u)  =  0  Vu  E  f,,  Vv  E  T. 

Then  (5.5)  holds  with 

b,  <  C{  1  +  log2p),  t  =  1,2,3, 

where  C  is  independent  of  p. 

Proof.  First  consider  the  case  when  K  =  T  is  the  reference  triangle  with  the  vertices 
v,,i  =  1,  2, 3  and  sides  7,  ,  i  —  1, 2,  3  opposite  to  t>,  (see  Fig  2.2).  Let  u  E  Vr,(K),  f  ud(  dy  =  0; 

K 

then 

IMIi.tf  <  C\u\\.k 

By  Theorem  7.7  and  the  trace  theorem,  there  exists  a  function  /,  E  Vp[K)  such  that  /,  =  u 
on  7,  and  /,  —  0  at  v,.  Then  uv,,  =  u  —  /,  is  zero  at  7,  and  u(v,)  =  vv,,(v,).  Then  from  (5.7) 
we  have  |u^  f|,  K  <  |t5v.i | (  k  and  hence 

(5.9)  Iuv,iIi  k  ^  C\u\i 

giving  b 1  <  C. 

Now  for  U|  —  u  —  )T"_,  uv,7,  we  have  ||uij|,  ^  <  C||u||, and  using  Theorem  6.5,  we 
get 

u|l«i llT/2,7.  ^  C(1  +1og2p)||ui|;_K. 

The  rest  of  the  proof  for  this  case  is  analogous  to  that  of  Lemma  3.3. 

Let  us  now  consider  the  case  K  —  Q  and  the  vertex  t)  1  (see  Figure  2.1).  By  Corollary  6.3 
and  Theorem  7.9,  there  is  a  function  v  E  T*2  such  that  id7l  =  v|77  =  0,  u(vi)  =  u(t)|),  and 

IMIl.K  —  ^liUll  !,K- 
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Now  we  may  conclude  using  (5.7)  that 

IK.ilU  <  c\\u\\UK 

and  hence  b\  <  C  in  (5.5).  The  above  estimate  obviously  holds  for  uv.,,  i  =  2,3,4  as  well. 

The  proof  of  the  estimate  of  b2  and  fe3  goes  exactly  as  in  in  the  first  case  using  extensions 
by  Theorem  7.5.  0 

We  have  the  following  bound  for  the  tensor  product  space. 

LEMMA  5.4.  Let  VP(Q)  =  Vp{Q)  and  (5.1)  and  (5.8)  hold.  Then  |6,|  <  Cp2(l  +  log2  p). 
Proof.  The  proof  is  same  as  that  of  Lemma  5.6  for  K  =  Q  using  the  extension  by 
Theorem  7.6  rather  than  by  Theorem  7.5.  0 

We  are  now  ready  for  the  main  result  of  this  section. 

THEOREM  5.5.  Let  the  assumptions  of  Lemmas  5.2  and  5-4  hold.  Then 

rri[a(u,u)  <  d(u,  \(u);u,  A(u))  <  m2a(u,u) 


with 

(5.10)  m2/mi  <  <7(1  +  log2  p) 
if  Vp(T k)  =  VfaT)  or  VP(1K)  =  P2(7q)  for  all  K  £  K  and 

(5.11)  m2/m,  <  Cp2(l  -f  log2p) 

ifVp(K)  =  V) (Q)  for  some  K  G  K.  The  constant  C  in  (5.10)  and  (5.11)  is  independent  of 
p  and  K  and  the  bound  (5.10)  cannot  be  asymptotically  improved. 

Proof.  Combining  Lemmas  5. 1-5.4  we  obtain  (5.10)  and  (5.11).  The  proof  that  (5.10) 
is  sharp  is  same  as  in  the  proof  of  Theorem  3.5.  □ 

We  conjecture  as  in  Section  3  that  the  estimate  (5.11)  is  pessimistic  and  that  it  can  be 
in  fact  replaced  by  (5.10). 

6.  Polynomial  Subspaces  of  H ’/2.  In  this  section,  we  give  several  results  for  spaces 
of  polynomials  on  a  segment,  which  are  of  interest  in  themselves.  First  we  prove  a  discrete 
Sobolev  inequality  for  the  norm,  bounding  the  pointwise  value  of  a  polynomial  in 

terms  of  its  H '^2  norms.  Then  we  bound  the  H^2  norm  of  a  polynomial  which  is  zero  on 
the  endpoints  in  terms  of  its  norm.  In  what  follows  we  have  K  —  Q  or  K  —  T,  but  we 
use  the  coordinates  x,y  instead  of  (,tj. 

We  begin  with  a  lemma  which  will  allow  us  to  consider  trigonometric  polynomials  instead  . 
of  algebraic  ones.  Let  I  =  (  — 1,-fl)  and  7’  =  (0,7r). 

LEMMA  6.1.  Define  the  mapping  by 

(6.1)  u  6  H^2(I)  t— ►  Vu  =  v,  u(<p)  =  u(cosip),  <p  £  /*. 

Then  $  w  a  linear  homeomorphism  between  HX^2{I)  and  HX^2{I"). 
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If  u  €  Vv{I)  and  v  —  'F(u),  then 


v{ip)  =  XI  bk  cos  kip 


mi  e  «(*  + 1). 


Proof.  Define 

n  =  r  x  /, 

$  :  (v»,0  -»  (*,y),  *  +  *y  =  cos(y>  +  *0, 

Z  =  $(fi). 

Because  the  cosine  is  a  conformal  mapping,  the  Cauchy-Riemann  conditions  hold, 


=  yo  *c  =  -yv» 


and  we  have  the  Jacobian  of  <1>, 


J=*l  +  vl*  0,  V(v?,C)€D, 

because  —  sin(y?  +  *C)  —  cos'(v?  +  i()  =  iw-  4-  ty^  0  for  all  (<p,  C)  E  D.  Now  let  u  be  a  smooth 
function  defined  on  Z.  Define 

If'(u)  =  v(<p,()  =  u($(<p,()). 

Then  v  is  defined  on  Q.  From  the  chain  rule  and  Cauchy-Riemann  conditions, 
v pp  —  T  Uyy^,  vq  ~  uzxq  T  —  uxy^  -4-  UyX^,, 

and  by  a  simple  computation, 


V%  +  ^c2  =  (U*  +  »;)(xj  +  y2 )  =  (u2  +  u2y)X 


By  substitution, 


W\\,z  =  %)  dxdy  =  I (ul  +  u*)J  dipd(  =  J \vl  +  v\)  dp  d(  = 


Finally,  from 


M2,Z  =  f\u\2dxdy  =  J  \v\2Jd<pd( 
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and  the  equivalence  of  norms 

Ml?  .11  ~  ^lzll  M  +  J  V)\z\*  dtp  ,  z£H'(Q), 

which  holds  for  any  function  w  6  L^°(f2)  such  that  w  >  0  and  f(l  wdtpd(  >  0,  it  follows  that 

(6.4)  gll*(«)ll..n<ll«ll,.Z<C||*(u)||,.n, 

with  C  independent  on  u.  By  continuity,  (6.4)  holds  for  all  u  €  H'(Z). 

Now  if  u  £  Hlf2(I),  we  may  extend  u  to  u  £  H](Z)  so  that  IMI1.Z  <  <?IMIi/3./.  and  it 
follows  from  (6.4)  and  the  trace  theorem  that 

l|*(«)ll. /•->./•  <  C'||#(tt)||,.n  <  C\\u\U,z  <  C\\u\\U2J. 

The  inverse  inequality  follows  similarly  using  in  place  of  . 

To  prove  (6.2),  it  is  sufficient  to  note  that  we  have  cos n  <p  =  a*  cos  n<p  for  suitable 
ak.  Equation  (6.3)  follows  by  direct  evaluation  of  Sobolev  norms  of  a  Fourier  series  and  by 
interpolation.  □ 

The  following  theorem  is  a  discrete  Sobolev  inequality  for  polynomial  spaces. 
THEOREM  6.2.  Let  I  =  (  —  1,  Tl),  u  £  VP(I),  and  x  £  I-  Then 

M*)l  <  +  log,/2p)||tt||l/2./, 

with  C  independent  on  p,  it,  and  x. 

This  estimate  cannot  be  asymptotically  improved,  i.e.,  there  is  a  constant  C  such  that 
for  each  p  >  2  there  exists  up  6  VP{I)  such  that  ||i'p|||/2,/  <  C  and  |up(  — 1)|  >  log'^p. 
Proof.  By  Lemma  6.1,  we  can  consider  instead  the  case  v  £  Hl,2(I'), 

v 

v(t)  =  53  akCOs(p), 

k~i) 

and  if  £  /',  x  —  cosip.  Then  by  the  Cauchy-Schwarz  inequality, 

MV») I  <  53  la*l  =  E  KKfc  +  1)l/27TTTwi 
<  (tkf(Hl|)  1  <C|M|1/m.(1  +  Io6'«p). 

To  show  optimality  of  the  estimate,  it  is  enough  to  choose  ak  =  (k  f  l)-1  and  x  =  —  1.  □ 
Corollary  6.3.  We  have 

(6-5)  II u || (a*)  <  C{\  +log,/2p)||u||t,K,  Vu  e  V'P(K), 
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i  —  1,2  or  3,  where  C  does  not  depend  on  u. 

The  following  lemma  extends  the  results  of  Lemma  6.1  showing  that  the  mapping  *1' 
preserves  the  subspace 

LEMMA  6.4.  Let  ’P  be  defined  as  in  (6.1).  Then  u  <E  H,„fi(J)  if  and  only  if 
'I'(u)  £  H)Jfi(I'),  and  is  a  linear  homeomorphism  between  H'J,2(I)  and 

Proof.  Let  v  =  'F(u).  It  suffices  to  use  Lemma  6.1  and  to  note  that  we  have  for  the 

•  *  l  /  2 

additional  term  in  the  norm  that 


I 

-  I 


k2(*)l 

X2  -  1 


dx 


/ 


|u(cos-  p)\2 
I  cos  p  —  1| 


(—  sin  p)  dp 


<  c 

>  1/C 


|  ?  jrML 

)  l  V2(7r  -  1 P ) 


since 


sin  p 

1  —  cos2  p 


< C  \  1 

>  1/C  J  p{ir  -  pY 


p  e  (o,tt). 


□ 

Theorem  6.5.  Let  Zp  be  the  space 


zp  =  {u  e  v„{i)  |  u(-i)  =  u(  +  i)  =  0} . 


Then  for  all  p  >  0, 


(6.6) 


uiiuiil/2./  <  C(1  +  log  p)|!uj;  \/  }J,  Vu  6  Zp, 


with  a  constant  C  independent  on  p  and  u.  The  bound  (6.6)  cannot  be  asymptotically 
improved,  that  is,  for  every  p,  there  exists  a  function  v  £  Zp  such  that  o|M|i/2,//IMIi/2,/  -2 
Clogp,  C  >  0  independently  of  p. 

Proof.  From  Lemmas  6.1  and  6.4,  it  suffices  to  consider  instead  the  case 

p 

v(p)  —  a*  cos  kp ,  v(0)  =  0. 

k- n 


We  use  the  fact  that  v(0)  =  YPk= o  ak  =  0  and  estimate  by  the  Cauchy-Schwarz  inequality. 


But 


f  dp  -  j  -  ( afc(cos  kp  -  1  )Nj  dp 

l  V  l  f 


(cos  kp  -  1 )"’ 


k  f  1 


dp 


chWWEj 


p  '  (cos  kp  -  l)2 


*:=!  r, 


p(k  +  1) 


dp . 


n 

1 


(cos  kp  —  1)" 


kir 


dp  =  j 


(cos  ip  -  l)2 


dip  <  C(  1  +  log  k), 
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Hv>)i: 


A  logfc 


<  cimi?/2.,.  E  -^7  <  c\\v\\l/2J.c(i  +  log2 p). 


The  bound  of  the  integral  /  1 dip  when  u(7r)  =  0  follows  by  the  substitution  of  <p  for 


IT  —  if. 

To  show  that  the  bound  (6.6)  is  sharp,  let 


v(f)  =  EafcCOS  ktp 


'  1/k,  2  <  k  <  M  | 

ak  =  «  —0/k,  M  <  k  <  p  J 

0,  k  odd  or  k  =  0. 


k  even 


where  At  =  [y/p]  (integer  part)  and  /?  =  0(1)  is  chosen  so  that  E/t=u  a*  =  0,  i.e., 
v(0)  =  v(w )  =  0.  Now  expand  v  so  that 

OC 

v(<f)  =  E  s*ri  my?. 


Since  |H(2/2  (()  ss  logp,  it  will  suffice  to  show  that 


We  have 


oIHlW.*)  ~  E  mb2m  ^  ^logV 


(0,  m  even, 

|  E  JrpH,  ™  odd 

2<k<p 
k  even 

Then  6m  =  (2/tt)(6^>  -  /36^>),  where 

/,(!)_  V  2m  1  >,(2)  =  V  -  2-—~ 

m  2<feu  -2  ~  k2  *’  m 

Ac  even  A:  even 

The  function  x  t—>  j— decreasing  on  the  interval  (2,  m/ %/3) ,  increasing  on  the  intervals 
(m/\/3,7Ti)  and  (m,+oo),  and  positive  on  ^2, m).  Hence, 


/  *  \  /  r — >  2m  r — ^  2m 

" a  + (-"j  -  **>* 


'2<fc<[m/v/3)-l 
Ac  even 

,  (m/v/Sj-  1  _ 


/  V  2m  dx  j 

i  J  (m2  -  x2)x  J 


m-H  < A:<  A/ 
Ac  even 


2m  dx 


(m2  -  x2)x  (m2  -  (m  +  l)2)(m  +  1) 
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Using  the  fact  that 


(m2  —  x2)x  2m2  m 


2  _  -r2 


we  have  by  a  simple  computation  that 


b£>  >  <7>o,  4  <  m  <  M. 


Because  b$  <  0  for  m  <  Af,  we  may  conclude  that 


fcm  >  c 


log  m 


C  >  0,  4  <  m  <  M 


and  thus 


OO  l\/pi  1-  „  ~ 

S  mb m  >  C'  £  - —  >  C'  log1  p,  C  >  0, 

,  .  nr2 

for  all  sufficiently  large  p.  □ 

The  next  theorem  gives  a  different  bound. 

THEOREM  6.6.  Let  Zp  be  as  in  Theorem  6.5.  Then  for  all  p  >  0, 
olMlf/2,/  <  IMIl/2,f  +  C(1  +  logp)||u||i»(/). 

Proof.  It  is  sufficient  to  bound  the  additional  term  in  the  H^2  norm.  We  have 

/  v-^dt  +  [  vm*t. 

j  i-t  i  i-t  i  i-( 


i-i /p2 


By  Markov’s  inequality,  cf.,  [19], 


Klk°°(0  Cp2\\v\\L„{l), 


so  \v(t)\  <  C(i  -  0p2llvlk»(/)  and  we  obtain 


i  2,  »  l  i  >-'/p2  \ 

/  (l  -  Op'*  +  /  — 

-I  V-l/p2  -i  / 

^  ^  ^2  +  ^log^pj  II v II £.°°(/) ■ 


The  analogous  bound  on  /  dt  follows  by  substitution.  □ 
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Fig.  7.1.  Notation  scheme  for  the  triangle 


A  =  (0,0)  7i‘  Pi  1?  B  =  (1,0) 


7.  Polynomial  Extensions  from  the  Boundary.  Let  K  =  T  or  K  =  Q.  If 
/  6  Hl/2(dK),  then  there  exists  an  extension  F  €  FT!(fif)  so  t^iat  /  =  F  on  dK  and 

\\P\U,K  <  CWfWi/tw 

The  maun  result  of  this  section  is  that  if  /  is  a  polynomial  of  degree  p  on  all  sides  of  K , 
then  F  can  be  chosen  to  be  in  V'p  i  —  1,2  if  K  is  a  triangle  or  a  square,  respectively.  This 
extends  previous  results  from  Babuska  and  Suri  [4]. 

Let  us  consider  the  triangle  T  =  ABC  as  shown  in  Fig.  7.1.  We  denote 

7.  =  7i‘  U  7?  =  ~AP\  U  TTB  =  AB, 

72  =  72  U72c  =  AP2  U  P2C  —  AC, 

73  =  7:fU73C=^U^  =  m 
Let  /  €  Vp( 71).  Then  we  define 


(7.1) 


dt. 


The  value  of  F\  at  a  point  ( x,y )  6  T  depends  only  on  the  values  /  along  the  segment 
Q\Q2,  Q\  —  (x  —  -^,0),  Q2  —  {x  -^,0).  We  prove  now  the  following  lemma. 

LEMMA  7.1.  Let  f  €  Pp(l\ )  and  F\J\x,y)  be  defined  by  (7.1).  Then 


€  Vl(T), 
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(7.2) 


(73) 

f,U1(*,0)  =  /(*) 

(7.4) 

ll^'lli.r  <  CII/11,/,.7, 

(7-5) 

<C|l/lkv.  »<*<>. 

(7-6) 

<  cii/iu,,.  0  <  k  <  1, 

(7.7) 

l|f,Wll4.,c  <  CH/lk,,,  0<*<1, 

(7.8) 

Ill'll*.,?  <  Cll/llon,  0  <  k  <  1, 

where  the  constant  C  is  independent  of  p  and  f. 

Proof.  It  is  immediate  that  (7.3)  holds.  Let  f[x)  =  xn  with  0  <  n  <  p  integer.  Then 


=  /  3 tn  dt  — 


\/3 


n/3 


2y(n  4-  1) 


x  + 


v/3 


n+l 


-  * 


y 

v/3, 


n+l 


X  + 


2  y(n  +  1) 

1  ;/>„(*,»)  ePlfT). 


Vs)  (*  Vs) 


pn{x,y) 


(n  +  1) 

Hence  (7.2)  holds. 

To  prove  (7.4)  we  first  extend  /  to  a  function  defined  on  the  entire  z-axis  3?  so  that  [18] 


II/II./2,*<cii/ii1/2, 


+1’ 


where  we  have  used  the  same  notation  /  to  denote  the  extended  function  as  well.  Then  by 
(7.1),  F i(x,y)  is  well  defined  on  the  entire  half  plane  fl  =  {{ x,y )  |  y  >  0}.  For  (x,y)  £  fi 
we  have 


+oo 

(7.9)  Fi(x,yy=  I  f(t)H(x -t,y)dt  =  (/**(-,»))(*) 

— oo 


where 

(7.10) 


H(x,y) 


2  £ 
otherwise. 


Let  g(()  represent  the  Fourier  transform  of  the  function  g(x)  in  the  x  direction.  Then  by 
(7.9) 

(7.11)  m,v)  =  m)m,y) 


where 

(7.12) 


_J_v/3  V'f3  ,£z  1  sin(^y/y/3) 

v/27r  2y  !eaX^  iylV2 

-v/V5 
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Let  fi  =  {(£,y)|0  <  y  <  2}  and  calculate  the  H'(£l)  norm  of  Fi(x,y).  By  Parseval’s  equality, 
we  have  using  (7.11) 


'  i  II//*  («)  ~  II  •  II  /f t  (jj)  —  /  l/(0|2IW«.S)|2^  <*» 


+  /  \m\2\^H(S.,y)?dtiy  +  j  |/(f)|2l H((,y)\2d(dy. 


Now  letting  z  =  y£jV 3  we  obtain  by  (7.12), 

2  _  2f/v^ 


J  \H(t,y)\2  dy  =  —  j  VI 


sin2  z  dz  C 


< 


*2  \(\  ~  |*|  +  1 


Hence 


+oo 

(713)  f  \hW\(H((,y)\2d(dy  <C  j  |{||/(OIJ<<f  <  C||/llf/„<  C\\!\\\n,„. 


Also 


dy 

which  is  bounded  at  z  =  0.  Hence 


9  it,,  \  £  (cosz  sinz 


VOn  \  z  z2 


)■ 


J_  r  /cosz  sinzy 

\(\  J  \  z  z2  ) 


2Z/V3 


c 


1*1  +  1’ 


and 


J  dy<C\(\, 


so  that 
(7.14) 


I  \hi)\2\j-ym,y)\2di  dy  <  C  J  mm2dt  <  \\f\\]/2, 


71 


The  third  term  can  be  bounded  analogously.  Using  (7.13)-(7.14),(7.4)  follows.  Inequalities 
(7.7)  and  (7.8)  follow  immediately  for  k  —  0,  k  =  1  and  hence  by  an  interpolation  argument 
(see  [6] ) ,  they  hold  for  all  0  <  k  <  1. 

We  prove  now  (7.5).  Let  the  variable  x  be  used  to  represent  both  the  distance  from  A 
along  71  and  the  distance  from  A  along  72.  Denoting 

g(x)  =  -J  mu 

O 
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(7.15) 


it  is  readily  seen  that 

(7-16)  lld,lIU.v  =  l|C(x)lkf  l  =  (0,1/2). 

Using  (9.9.1)  of  [12],  p.  244  we  obtain 


(7-17)  l|G(*)lk;  <  C||/||„.,,. 

Further,  integrating  (7.15)  by  parts  we  have 


G(*)  =  /(*)-!  /  tf'(l)dt 

X  J 

u 


and  hence 


g'(x) = /'(*>  +  ~  j  = 

U 


u 


j  m  it. 

U 


Using  (9.9.5)  of  [12],  p.  245  with  r  =  2  we  obtain 


i  /<*  -moit 

o 


<  ciirik, 

u,/ 


and  by  (9.9.1)  of  [12]  p.  244  we  obtain 


Hence 


u,/ 


(7-18)  ||G'(x)||„t,  < 

Combining  (7.17)  and  (7.18)  we  obtain  (7.5)  for  k  =  0  and  A:  =  1  and  hence  by  the 
interpolation  argument  (7.5)  holds  for  all  0  <  k  <  1.  The  inequality  (7.6)  is  essentially 
the  same  as  (7.5)  and  Lemma  7.1  is  completely  proven.  0 

Let  now  /  =  /,  €  Vp{ 7«),  i  =  1,2,3.  Then  we  denote  by  -F,^(x,y)  the  polynomial 
extension  of  /,  into  T,  defined  for  t  =  1  by  (7.1)  and  for  t  =  2,3  by  (7.1)  after  properly 
rotating  the  coordinates.  Obviously  Lemma  7.1  is  applicable  for  i  =  1,2,3  when  properly 
interpreted  through  the  rotation  of  the  coordinates. 

Let  7i  and  be  the  two  sides  of  T  (see  Fig.  7.1)  with  the  common  vertex  A.  In  the 
sequel,  we  will  use  norms  of  the  form  |]  •  defined  by  ||«||fct7lU7j  =  ||u||jti7l  + 

for  k  <  1/2  and  for  u  continuous,  k  >  1/2  and 


!/2,7iU72 


U 


lU, 


+  llu!l 


2 

1/2.77 


+ 


l 

/ 


(u | (t)  -  U2(t))2 


dt 
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where  u,  is  the  restriction  of  u  to  7,,t  =  1,2  and  t  represents  the  distance  from  A  along  71 
or  72.  These  definitions  may  be  extended  to  the  case  of  different  sides  of  T  in  an  obvious 
way.  The  norm  ||  •  ||i/2,y7  can  be  defined  analogously. 

We  now  prove 

LEMMA  7.2.  Let  T  be  the  triangle  as  in  Fig.  7.1  and  f  be  continuous  and  such  that 

ft  =  f\  G  Vp( 7i),  i  =  l,2  where  by  f\  we  denote  the  restriction  on  f  on  7 Then  there 

17;  hi 

exists  4>,  G  Vp( 7,),  i  =  1,2  such  that 


(7.19)  U  =  F\*']  +  G  V'P{T), 


(7.20) 

U  =  f,  on  7,,  i  =  1, 

2, 

(7.21) 

IMI1.T  <  |I/H./2.7,U7,7 

(7.22) 

ll^i||fc,7i  —  ll/!!*mU7f?  1  t  =  1,2, 

0  <  k  <  1, 

(7.23) 

<c(ii/1ii*.,.+tn/>ik,; 

'  J=1 

),  0  <  k  <  1 

(7.24) 

ll*2ll*.,c  <  c(||/,||t.,c  +  £ (l/illo.,,) 

|,  0  <  k  <  1, 

where  C  is  constant  independent  of  p  and  f. 

Proof.  Let  <!>,  G  Vp(-y,).  Then  as  in  Lemma  7.1  we  define 

.  * 

Gt(x)  =  -  f  $i(t)dt,  i  =  l,2. 
X  J 

u 


Condition  (7.20)  will  be  satisfied  if 

X 

$«(*)  +  G2(x)  =  $i(*)  +  -  f  $2 (t)dt  =  /,(x) 

x  J 
0 

M*)  +  Gx(z)  =  *2(*)  +  -  [  $i(t) dt  =  f2(x) 

X  J 


(7.25) 


u 

hold  for  all  x  G  /  =  (0, 1).  Since  /,  G  Vp{ I)  it  is  easy  to  see  that  G  VP{I)  satisfying  (7.25) 
exist.  <£,  are  uniquely  determined  up  to  an  additive  constant  i  =  1,2.  We  now  define 


¥,(*)  =  $,(*)  + $2(*),  *2(*).=  *■(*)-  *2(«), 

{  1  M*)  =  /,(*)  +  f2(x),  h2(x)  =  /,(*)  -  f2(x) 

we  see  that  h\  G  H^2{I),h2  G  and  ..t||^2||i/2,/  <  ll/ll  1/2,7, ut,»  where  we  define 


ll/MI2 


i/2, 1 


~  PII?/2,7  + 


jMfdl 
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and 


aH'/2{I)  =  {u  €  H'/\I)  |  .4 ll^ll I /2,l  <  +oo}. 

Note  that  the  space  ,\H'^2(I)  is  obtained  by  interpolation  of  L2(I)  and  the  space 

aH'{I)  =  {ue  h'(I)  |  u(o)  =  0} 
as  .4 H'>2{I)  =  [L2(7),  aH'(I) j1/2  .  By  (7.25)  we  have 

X 

(7.27)  $'(X)  +  x  j^^dt  ~  M*)> 

0 

x 

(7.28)  *2(x)--  J<H2{t)dt  =  h2{x). 

o 

Here  'Pt(x)  is  unique,  'E'i(O)  =  ,  while  ^(i)  is  unique  up  to  the  additive  constant  K . 

We  first  analyze  (7.27).  By  differentiation  we  obtain 

V-4  / ¥,(*)<&  +  -*,  =/>,'. 

X  *  «c 

u 

Using  (7.27)  we  obtain 

(7.29)  +  —  =  /t',  +  — . 

X  X 

The  homogeneous  solution  of  (7.29)  is  1/x2.  A  particular  solution  can  be  found  by  using  the 
method  of  variation  of  constants.  Hence,  substituting  'P'i(x)  =  into  (7.29)  we  obtain 

T'(x)  =  h\x2  +  Si 


from  which 

X  X 

^i(x)  =  ^2  /  +  ~2  J  th\{t)dt. 

0  u 

Integrating  by  parts  we  obtain 

X 

(7.30)  ^fi(x)  =  hi(x)  -  ~  J  th\(t)  dt. 

u 

the  unique  solution  of  (7.27).  We  show  now  that 

(7.31)  l!*>lk,  <  cum*,/,  o<fc<i. 
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Let 


XX  X 

*•(*)  =  J  th\(t)  dt  =  —  J (x  —  dt  4-  x  J  hl(t)  dt. 


Then 


where 


1  1  1 
G(x)  =  J  (x  —  t)h{(t)  dt,  Q(x)  =  -  j  h\(t)dt. 

o  o 

Using  (9.9.4)  of  (12],  p.  245  with  r  =  2  and  (9.9.1)  of  [12], p.  244  we  obtain 
II F(x)/x%j  <  || G(x)/x%j  +  HQKu./  <  <7||Mkf 
which  yields  (7.31)  for  k  =  0.  Next,  differentiating  (7.30)  and  integrating  by  parts  we  obtain 

X  X 

=  h\  +  lf  th,(t)dt  -h  =  h\-^j  t7h\{t)dt. 


Let 


XX  xx 

F(x)  =  J  t2h\[t)dt  —  J (i  —  t)2 h\(t)  dt  —  x2  J  h[(t)dt  +  2x  J  th\{t)dt. 


We  have  then 


where 


S£l.  <3=1 


and 


1  1  1 
G(x)  =  j{x-tfh\{t)dt,  Q(*)  =  -/  h\{t)dt, 


x 

R(t.)  =  j  th\(t)dt. 


Iin*)/*3|l...<  <  l|G(*)/x3»„.-  +  ||«(x)||U  -  4-  ||fl(x)||„., 
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This  gives 


The  first  two  terms  can  be  bounded  once  more  by  C\\h\  j|„,/  using  (9.9.4)  of  [12],  p.245  with 
r  =  3  and  (9.9.1),  p.244.  Moreover, 


R(x) 


2 


X 

-  J{x  -  t)h\{t ) 


dt  +  x  J  h\{t ) 

0 


so  that  \\R\\0J  can  also  be  bounded  by  This  yields  (7.31)  for  k  =  1.  By  the 

interpolation  argument  (see  [12])  we  obtain  immediately  (7.31).  Let  us  consider  now  (7.28). 
Differentiating  it  and  using  once  more  (7.28)  we  obtain 

(7.32)  %=h'2  +  —. 

X 

Integrating  we  obtain 


(7.33) 


$2(2:)  =  Mx)  -  J  dt. 


which  is  the  solution  of  (7.28)  with  $2(1)  =  ^(l)-  We  wish  to  show  now  that 


(7.34)  ||*2lk/ <C.4|jM*./,  0<k<l. 

Using  (7.33)  and  (9.9.9)  from  [12],  p.  245  with  a  —  0  we  obtain 

l|*2|k/  <  C||MI»./- 


Since  /i2(0)  =  0,  (7.32)  yields 

X 

%  =  h'2  +  -  f  h'2{t)dt 

x  J 

0 

and  by  (9.9.1)  of  [12],  p.  244  we  obtain 

Mm  <  C'.JMli./- 

An  interpolation  argument  leads  immediately  to  (7.34).  Hence  we  have  constructed  solutions 
of  (7. 27), (7. 28)  such  that  (7.31)  and  (7.34)  hold.  We  note  that  for  k  =  1/2,  we  have  from 
(7.34)  ||'f,2||,/2,/  <  C.4 1| h2 1|  1/2,/  and  .4 1|^2||  1/2.A  cannot  be  replaced  by  ||h2||t/2,/.  Coming  back 
to  (7.26),  using  k  =  1/2  we  see  that  for  i  =  1,2, 

M  1/2,7.  $  ll/ll  1/2.7,  U77» 

and  applying  Lemma  7.1  we  obtain  immediately  (7.21)  and  also  (7.22).  Returning  to  (7.27), 
(7.28)  we  see  that  with  /*  =  (1/2,  1), 
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t  =  1,2. 


Hence  also 


l*.IU.i-<C  ||/,lk/-  +  Ell/;l 


In,/ 


=  1,2 


j=  i 


which  immediately  leads  to  (7.23),  (7.24).  □ 

The  following  lemma  is  taken  from  [5j. 

LEMMA  7.3.  Let  T  be  the  triangle  as  before,  f  be  continuous  on  dT ,  /2  =  /;i  =  0  and 
f ,  £  Fp( 7[).  Then  there  exists  a  polynomial  v  £  'Pp(T)  suc/i  t/iat 

IMIi.r  <  C'||/1||i,7l,  v  =  /,  on  7i,  v  =  0  on  72,73, 


where  C  is  a  constant  independent  of  f  and  p. 

THEOREM  7.4.  Let  T  be  the  equilateral  triangle  shown  in  Fig.  7.1.  f  is  continuous  on 
dT  and  f,  =  / |7|.  £  Vp{ 7,),  t  —  1,2,3.  Then  there  exists  U  £  Vp{t)  such  that  U  =  /  on  dT 
and 


||C/||i,r  <  C||/||1/2,aT 

where  the  constant  C  is  independent  of  p  and  f. 

Proof.  First  we  prove  the  theorem  for  the  case  when  /  =  0  on  72  and  7,  and  hence 
with  fi  =  f  |Tl  we  have  o||/i ||  1/2,7,  ^  II/H1/2, or-  By  Lemma  7.2  we  construct  4>.,<1?2  and 
U ,  =  Fl^  +  Fl*’1.  Then  U,  €  V, (T),  r/  =  /,  on  7,,  i  =  1,2  and 

(7.35)  IIC, II, .7  <  CH/,11,/7,,^  <  C||/||,/2ln7 


Denote  by  <73  the  trace  of  C/(  on  73.  Then  we  have 

o||<7:)||  1/2,73  ^  C|!/I|i/2,3T 

by  (7.35)  and  the  trace  theorem.  Because  of  (7.24),  H^lli^c  <  C||/||i/2,ar  and  hence  using 
Lemma  7.1  we  have  also 


(7.36) 

Let  now  analogously  as  before 


IM,.7C  <  C\\f\\i/2,gr 

U 2  =  F?'"]  +  F\*'\ 


so  that 

U2  £  Fp(r),  U2  -  93  on  73,  f/2  =  0  on  7, 

and 

(7-37)  ||£4||i,r  <  C ollflfajl  1/2,73  -  ^il/ll  1/2, or- 
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Denote  by  <72  '  the  trace  of  £/2  on  72.  Then  g^(A)  =  <72  *(C)  =  0-  Because  of  (7.36),  applying 
Lemma  7.1  and  Lemma  7.2  analogously  as  before  we  conclude  that 

£  C  (11*11,.,;:  +  ||*||,/2.„)  <  C ll/ll, /2,„. 

Now  applying  Lemma  7.3,  there  is  t/3  £  V'(T)  such  that 


(7.38) 


\U*\\i.t  <  <  C\\f\\l/3#T 


Let  now 


U\  =  3^l]  on  72,  Ui  =  0  011  71,73. 


V  =  Ux  -  U2  +  U3 


Then  it  is  easy  to  see  that  V  £  'Pp(T),  V  =  /j  on  71,  F  =  0  on  72,73  and  because  of  (7.35), 
(7.37)  and  (7.38)  we  obtain 

S  C||/l|l/2,aT 

which  concludes  the  argument. 

Secondly,  now  we  will  address  the  general  case.  By  Lemma  7.2  we  construct  $1 ,  $2  and 
U,  =  F< *’>  +  Fl*’1.  Then  F,  €  P'(T),  CF,  =  /,  on  1,,„  =1,2  and 

l|tf,!l,.r  <  Cll/ll  1/2,7iU72  ^  Cll/ll  1/2, ar- 

Denote  by  <73  the  trace  of  U  on  73  and  <7 3  =53-/3  on  73.  Then 

and  hence  by  the  first  part  of  the  proof,  there  exists  V  £  Vp(T),  V  =  53  on  73,  V  =  0  on 

7«,  *  =  1,2,  ||V||,  j-  <  0II5.3  II  i/2,7j-  Hence  taking  U  -  V  Cut  jin 

||f/||,.r  <  C||/||1/Ww 

and  U  =  /  on  dT .  This  concludes  the  proof.  0 

Let  5  =  [(z,y)  |  |x|  <  l,|y|  <  lj  be  a  square  and  71  its  sides  as  shown  in  Fig.  7.2. 

THEOREM  7.5.  Let  S  be  the  square  shovm  in  Fig.  7.2  and  f  be  continuous  on  dS  and 

such  that  f  =  /I7,  £  1  =  1, ...  ,4.  Then  there  exists  U  £  Vp{S)  such  that  U  =  f  on 

dS  and 


,5  <  CH/II  1/2,95 


where  the  constant  C  is  independent  of  p  and  f. 


Fig.  7.2.  Notation  scheme  for  the  square 


Proof.  Let  T  be  triangle  shown  in  Fig.  7.1  and  Q  =  {(£ , 77) |(^, g)  6  T,  17  <  be  the 
trapezoid  shown  in  Fig  7.3.  The  mapping 

,  .  ..  s  .  1  3x .  5.  .  ,3\/3 

(*,y)  *-♦  (£,»7),  £  =  2  +  I6^~y  +  3^’  Ti  =  (i  +  y)~fir- 

maps  S  onto  Q.  The  mapping  is  obviously  one-to-one,  smooth,  the  Jacobian  and  its  inverse 
are  bounded,  and  the  mapping  is  linear  on  dS.  If  U  6E  'Pp(T)  then 

U(t, v)=  £  ®*>fV  =  U(x,y)6V2v(S). 

0<k +j<p 

Because  the  used  mapping  is  smooth,  it  preserves  all  norms  under  consideration.  By  / 
we  denote  the  function  on  d Q  obtained  by  transformation  of  /  and  by  /,  we  denote  the 
restriction  of  /  on  7,.  First  we  construct  the  extension  of  /,  as  in  Lemma  7.1.  Hence  we 
can  replace  /  by  g^  where  =  0  on  one  side  of  S  and  ||^^'^ |(  1  /2.as  5:  (711/11 1/2, as-  We  can 

assume  that  we  achieved,  say,  <7^  =  0  which  leads  to  the  case  g\  =  0.  Extending  <7^  by  zero 

we  construct  as  in  Lemma  7.2  the  function  vt  =  <7^  on  7 ,  ^  =0  on  7.1  and 

Pi||i,t  <  ll/H  1/2, as- 

Hence  we  can  replace  /  by  p(2l,  gW  =  0  on  two  neighboring  sides  of  S  say  72,  73  and 
II? 2'||i/2,as  <  C||/|| i/2, as-  Repeating  once  more  the  construction  using  Lemmas  7.2  and  7.1 
we  replace  /  by  so  that  g PI  =  0,  say  once  more  on  72  and  73,  and 

IIS^!|,/2.7lU-n  <  <?ll/l|,/2.as,  <  C\mU2j,s,  <  C||/||,rt,S. 


Fig.  7.3.  Notation  scheme  for  the  trapezoid 


Repeating  once  more  the  process  and  using  Lemmas  7.1  and  7.2  we  reduce  our  original 
problem  to  the  extension  problem  on  5  when  ||pj^  !li/2,as  <  C||/||,/2,r)s  and  ]|sr!l'||i.7t  < 
£ ll/||i/2.as*  is  now  easy  to  construct  u2  £  Vl{S)  so  that  v2  =  <7^  on  dS  and 

\Mus<±\\gl%,. 

i=i 

which  leads  immediately  to  the  desired  result.  □ 

Theorem  7.5  is  concerned  with  the  space  "P2(5).  Theorem  7.5  does  not  hold  for  V'{S), 
but  there  is  a  weaker  statement  easily  available. 

THEOREM  7.6.  Let  the  assumptions  of  Theorem  7.5  hold.  Then  there  exists  U  £  V*(S) 
such  that  U  —  f  on  dS  and 

Ill'll, ,5  <  CpII/IW 

Proof.  By  Schmidt’s  inequality  [13],  ||/j|i,7,  <  C'p2||/||0,7l  and  by  interpolation, 

ll/ili.-r.  <  CP\\f\\l/2,as. 

Let  us  define 


and  analogously  U,,  i  =  2,3,4.  Then  V  =  £/_,  U,  is  continuous  on  dS  and  V  -  f  is  linear 
on  every  7,.  Since  ||(/,|]i,s  <  C||/,||  ii7i.,  we  have 

||V||1,s<C7p||/||1/2.a5. 
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Further  with  l,  —  V'|-,,  we  also  have 


II/-V.II,.,  <  cP||/||l/2.M. 

Let  U  be  the  bilinear  function  such  that  U  =  f  —  V  on  dS.  Then  also  ||t/||  <  C p||/|j i /2.o.h 
and  U  —  V  +  U  is  the  desired  extension.  □ 

Now  let  us  turn  to  extensions  where  only  some  boundary  values  are  given. 

^  Theorem  7.7.  Let  T  be  equilateral  triangle,  71  one  of  its  sides  and  v{  the  opposite 
verted  Let-  f  1  £  L>p(yi).  Then  there  exists  f  £  Vp{T)  such  that  f  =  /1  on  7i,/(«i)  =  0,  and 

^  ll/ll  l.T<C\\fi  || 

Proof.  By  Lemma  7.1,  there  is  F\  £  Vp{T)  such  that  F\  =  f\  on  7t, 

IIP, II, t  <C||/,||,„.„,  |F,(«,)|<C||/,||,,2,,1. 

It  suffices  to  put  F  =  f\  —  where  w  is  the  linear  function  such  that  tn(v, )  =  1  and 

w  —  0  on  71 .  0 

Here  is  a  similar  theorem  for  the  square. 

THEOREM  7.8.  Let  S  be  square  with  sides  7)  to  71  and  /j  £  Then  there  exists 

f  £  Vp ( S )  (resp.  Vp(S) )  such  that  f  =  f\  on  71,  /  =  0  on  the  opposite  side  7^,  and 

il/lli.s  <  C||/,||,/2.7I,  (  resp.  ll/ll,, 5  <  CpH/,11,/^) 

with  C  independent  on  f  and  p. 

Proof.  As  in  the  proof  of  Theorem  7.5,  transform  S  to  a  trapezoid  of  Fig.  7.3,  and  use 
Lemma  7.1  to  obtain  function  g  £  Vp(S)  such  that  g  =  f\  on  71,  and 

Wi.s<C||/,||l/,.,. 

If  follows  immediately  from  the  construction  (7.1)  that 

M,.„  <cil/llo.„. 

Let  5  be  as  in  Fig.  7.2,  i.e.,  71  characterized  by  y  =  —  1,  x  £  [—  1,+lj  and  73  by  y  =  1, 

x  £  f  —  1 ,  +1].  Let  v(x,y)  =  g{x)(y  4-  1).  Then  v  =  0  on  71 ,  v  =  g  on  73,  and 

IMks  <  c'llslk,,. 

Then  /  =  g  -  v  is  the  desired  extension  and  it  holds  that 

lb  -  H|l.S  <  Iblll.S  +  IM|l.S  <  C'll/l  II  1/2.7,- 

The  second  case  follows  from  the  first  one  using  Theorem  7.6.  □ 

Our  last  theorem  is  concerned  with  the  extreme  casf  of  extending  a  function  from  a 
vertex  onto  a  square. 


THEOREM  7.9.  Let  S  be  square  with  sides  71  to  71  and  and  A  its  vertex  adjacent  to  71 
and  7i.  Then  there  exists  a  constant  C  such  that  for  all  p  >  1,  there  is  a  function  v  £  Tf’(S) 
such  that  u(A)  =  1,  v  =  0  on  the  two  sides  72  and  72  which  are  opposite  to  the  vertex  A, 
and 


<  (7(1  +  logp)  1/2. 

Proof.  It  suffices  to  prove  the  theorem  only  for  all  sufficiently  large  p.  Mapping  the 
function  from  the  second  part  of  Theorem  6.2  onto  the  sides  71  and  71  so  that  the  point  —  1 
is  mapped  to  the  vertex  A,  and  extending  the  function  by  reflection  also  on  72  and  73,  we 
get  a  function  u  on  dS  such  that 

IM]  l/2,as  <  (7(1  +  logp)~1/2,  u(A)  =  1. 

Note  that  ||uj|i/2.t>s  =  ll“lli/2,-y,  (up  to  equivalence  of  norms)  by  symmetry.  The 

function  u  is  not  a  polynomial,  but  its  restriction  to  each  side  is  a  polynomial  function.  By 
Theorem  7.5,  there  is  an  extension  of  u  onto  5  such  that  u  £  T2  and  ||u||t,5  <  (7||ti|| i/2.as- 
Let  w  £  V  '  such  that  w(A)  =  1  and  w  =  0  on  72  and  73,  and  put  v  =  wu.  Then 

IM|i,s  <  C|M|i,5  <  <7(1  +  logp)"1/2,  u(A)  =  1, 

and 


□ 
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